A Unified Approach for Modeling and Optimization of Energy, Makespan and Reliability for Scientific Workflows on Large-Scale Computing Infrastructures
نویسندگان
چکیده
Green computing has received significant attention in the past few years. Although some research has addressed cooling and energy usage reduction in large data-centers [1], they do not control how resources are used by applications. Scientific workflows are a useful representation for managing the execution of large-scale computations on high performance computing (HPC) and high throughput computing (HTC) platforms [2]. In scientific workflow applications, resource provisioning and utilization optimizations have been investigated to reduce energy consumption on Cloud infrastructures [3], [4]. However, existing research is largely limited to the measurement of energy usage according to resource utilization when running a program on an execution node. Furthermore, most existing optimization techniques for workflows are limited to single objectives (e.g. makespan), and some can deal with only two objectives. There does not exist an approach that deals with an arbitrary number of objectives and no scheduling technique explored tradeoffs among makespan, energy consumption, and reliability. We recently proposed [5] an energy consumption model for analyzing and profiling energy usage that addresses resource utilization, data movement, and I/O operations. Although our model assembles several models (computing, networking and storage systems) validated in a real execution environment, it still makes strong assumptions on the resource characteristics (e.g. single core homogeneous virtual machines), and ignores external loads. In this work, we propose 1) an extension of our energy consumption model to address real large-scale infrastructure conditions (e.g. heterogeneity, resource unavailability, external loads); 2) the validation of the model in a fully instrumented platform able to measure the actual temperature and energy consumed by computing, networking, and storage systems; and 3) a multi-objective optimization approach to explore tradeoffs among makespan, energy consumption, and reliability for multi-objective workflow scheduling.
منابع مشابه
Multi-objective list scheduling of workflow applications in distributed computing infrastructures
Executing large-scale applications in distributed computing infrastructures (DCI), for example modern Cloud environments, involves optimization of several conflicting objectives such as makespan, reliability, energy, or economic cost. Despite this trend, scheduling in heterogeneous DCIs has been traditionally approached as a single or bi-criteria optimization problem. In this paper, we propose ...
متن کاملA Clustering Approach to Scientific Workflow Scheduling on the Cloud with Deadline and Cost Constraints
One of the main features of High Throughput Computing systems is the availability of high power processing resources. Cloud Computing systems can offer these features through concepts like Pay-Per-Use and Quality of Service (QoS) over the Internet. Many applications in Cloud computing are represented by workflows. Quality of Service is one of the most important challenges in the context of sche...
متن کاملA Adapting Scientific Workflow Structures Using Multi-Objective Optimisation Strategies
Scientific workflows have become the primary mechanism for conducting analyses on distributed computing infrastructures such as grids and clouds. In recent years, the focus of optimisation within scientific workflows has primarily been on computational tasks and workflow makespan. However, as workflow-based analysis becomes ever more data intensive, data optimisation is becoming a prime concern...
متن کاملGreen Energy-aware task scheduling using the DVFS technique in Cloud Computing
Nowdays, energy consumption as a critical issue in distributed computing systems with high performance has become so green computing tries to energy consumption, carbon footprint and CO2 emissions in high performance computing systems (HPCs) such as clusters, Grid and Cloud that a large number of parallel. Reducing energy consumption for high end computing can bring various benefits such as red...
متن کاملImproving the palbimm scheduling algorithm for fault tolerance in cloud computing
Cloud computing is the latest technology that involves distributed computation over the Internet. It meets the needs of users through sharing resources and using virtual technology. The workflow user applications refer to a set of tasks to be processed within the cloud environment. Scheduling algorithms have a lot to do with the efficiency of cloud computing environments through selection of su...
متن کامل